The translation of examples , citations , definitions and glosses in the Papillon project

نویسنده

  • Christian BOITET
چکیده

The Papillon lexical data base comprises a set of detailed monolingual dictionaries of « lexies » (word senses) interlinked through « axies » (interlingual links) which can also refer to external semanticsoriented systems such as UNL « universal words », Worldnet « synsets », Ontos « concepts » or NTT ALT/JE system « semantic classes ». The basic idea is that bilingual or multitarget usage dictionaries can be generated ad libitum from the data base. This implies that examples, citations, definitions and glosses expressed in each language be translated into all other languages and stored into the data base. Storing can be achieved in a simple and « seamless » way by introducing « auxiliary» lexies and axies for these « free language elements ». But translating all these elements into all languages is a major subproject of the Papillon project. We propose to use the mutualization feature of the Papillon server and help voluntary contributors perform of postedit translations using a shared, web-oriented translation workbench using the « Montaigne » architecture. We also propose to use freely available UNL web sites to get first drafts of translations, thereby attaching full UNL graphs to auxiliary axies. In the future, a « coedition » technique, still at the research stage, could be used to improve the UNL graphs a posteriori and transparently from any language, and get improved translations in all target languages. Mots-clés : Papillon multilingual data base, N-N translation of dictionary information, Montaigne architecture, interlingual representation, coedition of text & UNL graph

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approaches to enlarge bilingual corpora of example sentences to more languages in memoriam

In a multilingual richly structured lexical data base such as Papillon, examples, citations, definitions and glosses expressed in each language have to be translated into all other languages and stored into the data base. Storing can be achieved in a simple and «!seamless!» way by introducing «!auxiliary» lexies and axies for these «!free language elements!». Translating all these elements into...

متن کامل

On The Effects of Literal Translation, L1 Glosses and Context, Applied in Reading Comprehension, on Iranian EFL Learners Vocabulary Learning: The Case of Different Proficiencies

This paper aims at discovering and investigating the effect of l1 translation, l1 gloss andcontext on vocabulary learning on EFL Iranian learners. A total number of 120 EFL students inprivate English institutes in Sari participated in the present study. They were divided into twoproficiency groups and three learning conditions. In order to make a list of words unknown to thelearners the partici...

متن کامل

The Feasibility and Applicability of Grounded Theory Methodology in Translation Studies: Developing Translator Competence

The development of sociological approach to the study of translation makes it possible for the researchers to adopt different methodology to develop new theoretical formulations and concepts. These formulations are arrived at through the interaction with those being studied through the interpretation of real social world and meanings of the participants involved in the translation/social event....

متن کامل

Low-Cost Enrichment of Spanish WordNet with Automatically Translated Glosses: Combining General and Specialized Models

This paper studies the enrichment of Spanish WordNet with synset glosses automatically obtained from the English WordNet glosses using a phrase-based Statistical Machine Translation system. We construct the English-Spanish translation system from a parallel corpus of proceedings of the European Parliament, and study how to adapt statistical models to the domain of dictionary definitions. We bui...

متن کامل

The Spanish version of WordNet 3.0

In this paper we present the Spanish version of WordNet 3.0. The English resource includes the glosses (definitions and examples) and the labelling of senses with WordNet identifiers. We have translated the synsets and the glosses to Spanish and alignment has been carried out at word level, whenever possible. The project has produced two interesting results: we have obtained a bilingual (Spanis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002